Decision Forest: Combining the Predictions of Multiple Independent Decision Tree Models
نویسندگان
چکیده
The techniques of combining the results of multiple classification models to produce a single prediction have been investigated for many years. In earlier applications, the multiple models to be combined were developed by altering the training set. The use of these so-called resampling techniques, however, poses the risk of reducing predictivity of the individual models to be combined and/or over fitting the noise in the data, which might result in poorer prediction of the composite model than the individual models. In this paper, we suggest a novel approach, named Decision Forest, that combines multiple Decision Tree models. Each Decision Tree model is developed using a unique set of descriptors. When models of similar predictive quality are combined using the Decision Forest method, quality compared to the individual models is consistently and significantly improved in both training and testing steps. An example will be presented for prediction of binding affinity of 232 chemicals to the estrogen receptor.
منابع مشابه
Comparison of gestational diabetes prediction with artificial neural network and decision tree models
Background: Gestational diabetes mellitus (GDM) is one of the most common metabolic disorders in pregnancy, which is associated with serious complications. In the event of early diagnosis of this disease, some of the maternal and fetal complications can be prevented. The aim of this study was to early predict gestational diabetes mellitus by two statistical models including artificial neural ne...
متن کاملComparison of disability score estimation in multiple sclerosis patients with artificial neural network and decision tree models
Background: Multiple Sclerosis (MS) is one of the most debilitating disease among young adults. Understanding the disability score (Expanded Disability Status Scale (EDSS)) of these patients is helpful in choosing their treatment process. Calculating EDSS takes a lot of time for Neurologists, so having a way to estimate EDSS can be helpful. This study aimed to estimate the EDSS score of MS pati...
متن کاملComparison of Gestational Diabetes Prediction Between Logistic Regression, Discriminant Analysis, Decision Tree and Artificial Neural Network Models
Background and Objectives: Gestational Diabetes Mellitus (GDM) is the most common metabolic disorder in pregnancy. In case of early detection, some of its complications can be prevented. The aim of this study was to investigate early prediction of GDM by logistic regression (LR), discriminant analysis (DA), decision tree (DT) and perceptron artificial neural network (ANN) and to compare these m...
متن کاملRanking stocks of listed companies on Tehran stock exchange using a hybrid model of decision tree and logistic regression
Much research has introduced linear or nonlinear models using statistical models and machine learning tools in artificial intelligence to estimate Iran's rate of return. The primary purpose of these methods is simultaneously use different independent variables to improve stock return rates' modeling. However, in predicting the rate of return, in addition to the modeling method, the degree of co...
متن کاملTopological Models for Prediction of Pharmacokinetic Parameters of Cephalosporins using Random Forest, Decision Tree and Moving Average Analysis
The topological indices were used to encode the structureal features of cephalosporins. Both topostructural and topochemical versions of a distance based descriptor, three adjacency based descriptors and five distance-cum-adjacency based descriptors were calculated. The values of 18 indices for each cephalosporin in the dataset were computed using an in-house computer program. Multiple pharmaco...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of chemical information and computer sciences
دوره 43 2 شماره
صفحات -
تاریخ انتشار 2003